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Abstract — We consider a general class of wireless scheduling 
and resource allocation problems where the received rate in each 
link is determined by the actions of the transmitter in that link 
along with a single dominant interferer. Such scenarios arise 
in a range of scenarios, particularly in emerging femto- and 
picocellular networks with strong, localized interference. For 
these networks, a utility maximizing scheduler based on loopy 
belief propagation is presented that enables computationally- 
efficient local processing and low communication overhead. Our 
main theoretical result shows that the fixed points of the method 
are provably globally optimal for arbitrary (potentially non- 
convex) rate and utility functions. The methodology thus provides 
globally optimal solutions to a large class of inter-cellular 
interference coordination problems including subband schedul- 
ing, dynamic orthogonalization and beamforming whenever the 
dominant interferer assumption is valid. The paper focuses on 
applications for systems with interference cancellation (IC) and 
suggests a new scheme on optimal rate control, as opposed to 
traditional power control. Simulations are presented in industry 
standard femtocellular network models demonstrate significant 
improvements in rates over simple reuse 1 without IC, and near 
optimal performance of loopy belief propagation for rate selection 
in only one or two iterations. 

Index Terms — femtocells, cellular systems, belief propagation, 
message passing, interference cancellation. 

I. Introduction 

A central challenge in next-generation cellular networks 
is the presence of strong interference. This feature is 
particularly prominent in emerging femto- and pico-cellular 
networks where, due to ad hoc and unplanned deployments, 
restricted association and mobility in small cell geometries, 
mobiles may be exposed to much higher levels of interference 
than in traditional planned macrocellular networks |jT)-|[3]. 
Methods for advanced intercellular interference coordination 
(ICIC) have thus been a key focus of 3GPP LTE-Advanced 
standardization efforts Q and other cellular standards organi- 
zations |5|. 

This work considers a class of general resource allocation 
and scheduling problems where the received rate in each link is 
determined primarily by the actions of the transmitter in that 
link along with a single dominant interferer. Interference in 
traditional macrocellular networks, of course, generally do not 
follow this assumption as there are typically a large number 
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of mobiles per cell, and interference generally arises from 
an aggregate of signals from many sources in the network. 
However, in emerging small cell networks with higher cell 
densities, the number of mobiles per cell is significantly 
reduced and strong interference is most often due to the 
poor placement of the mobile relative to a single victim or 
interfering cell to which the mobile is not connected to. 

For such dominant interference networks, we propose a 
novel ICIC algorithm based on graphical models and message 
passing. Graphical models Q, ||7) are a widely-used tool for 
high-dimensional optimization and Bayesian inference prob- 
lems applicable whenever the objective function or posterior 
distribution factors into terms with small numbers of variables. 
Methods such as loopy belief propagation (BP) then reduce 
the global problem to a sequence of smaller local problems 
associated with each factor. The outputs of the local problems 
are combined via message passing. The loopy BP methodology 
is particularly well-suited to wireless scheduling problems 
since the resulting algorithms are inherently distributed and 
require only local computations. Indeed, |8| showed that many 
widely-used network routing, congestion control and power 
control algorithms can be interpreted as instances of the sum- 
product variant of loopy BP. More recently, f9l, flO] used BP 
techniques for networks with contention graphs, [11] proposed 
BP for MIMO systems, and |T2) , p3) considered a general 
class of ICIC problems with weak interference exploiting an 
approximate BP technique in [ [T4| . 

For dominant interferer networks considered in this paper, 
we apply loopy BP methods to a general network utility 
maximization problem where the goal is to maximize a sum 
of utilities across a system with n links. The utility in each 
link is assumed to be a function of a scheduling decision on 
the transmitter of that link, as well as the decision of at most 
one interfering link. Aside from this single dominant inter- 
ferer assumption, the model is completely general, and can 
incorporate arbitrary channel relationships as well as sophis- 
ticated transmission schemes including subband scheduling, 
beamforming and dynamic orthogonalization - key features 
of advanced cellular technologies such as 3GPP LTE | [T5| . In 
addition, computing maximum weighted matching for queue 
stability |16J and maximization of sum utility of average rates 
for fairness | [T7| can be incorporated into this formulation. 

We show that for single dominant interferer networks, loopy 
BP admits a particularly simple implementation: scheduling 
decisions can be computed with a small number of rounds 
of message passing, where each round involves simple local 
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computations at the nodes along with communication only 
between the transmitter, receiver and dominant interferers in 
each link. Our simulations in commercial cellular models 
indicate near optimal performance in very small, sometimes 
only one or two, rounds - making the algorithms extremely 
attractive for practical implementations. 

Moreover, we establish rigorously (Theorem [T]) that any 
fixed point of the loopy BP algorithm is guaranteed to be 
globally optimal. Remarkably, this result relies solely on 
the dominant interferer assumption and otherwise applies to 
arbitrary utility functions and relationships between scheduling 
decisions on interfering and victim links. In particular, this 
optimality holds even for non-convex problems. Optimality 
results for loopy BP are generally difficult to establish: Aside 
from graphs that are cycle free, loopy BP generally only pro- 
vides approximate solutions that may not be globally optimal. 
Our proof of global optimality in this case rests on showing 
that graphs with single dominant interferers have at most cycle 
in each connected component. The result then follows from a 
well-known optimality property in JTS). 



A. Interference Cancelation and Rate Control 

A target application of our methodology is for scheduling 
problems with interference cancelation (IC) p9] , | [20) . When 
an interfering signal is sufficiently strong, it can be decoded 
and canceled prior to or jointly with the decoding of the 
desired signal, thus eliminating the interference entirely. IC, 
particularly when used in conjunction with techniques such as 
rate splitting |21 1, is known to provide significant performance 
gains in scenarios with strong interference, where it may 
even be optimal or near optimal [22j. IC may also be use- 
ful for mitigating cross-tier interference between short-range 
(e.g. femtocellular) links operating below larger macrocells 
||Z3|. However, while improved computational resources have 
recently made IC implementable in practical receiver circuits 
p4|-| 26 1, it is an open problem of how rates should be 



selected in larger networks when IC is available at the link- 
layer. 

In this paper, we consider a network where the receiver 
in each link can jointly detect the desired signal along with 
at most one interfering link. The limitation of joint detection 
with at most one interfering link is reasonable since the 
likelihood of having two very strong interferers is low in 
most practical scenarios. The limit is also desirable since the 
computationally complexity at the receiver grows significantly 
with each additional link to perform joint detection with. 
For networks with IC, we propose to perform interference 
coordination based on rate control rather than traditional 
power control that has been the dominant method in cellular 
systems without IC |[27). Specifically, each transmitter operates 
at a fixed power, and the system utility is controlled by 
rate: increasing the rate improves the utility to the desired 
user, while decreasing the rate makes the transmission more 
"decodable" and hence "cancelable" at the receiver in any 
victim link. Hence, we suggest that in networks with IC, rate 
control, as opposed to power control, can be viable method 
for interference coordination. 



II. System Model 

We consider a system with n links, each link i having one 
transmitter, TXi and one receiver, RXi. In a cellular system, 
multiple logical links may be associated with the base station, 
either in the downlink or uplink. Each link i is to make some 
scheduling decision, meaning a selection of some variable Xi e 
Xi for some set X^. The scheduling decision could include 
choices, for example, of power, beamforming directions, rate 
or vectors of these quantities in the cases of multiple subbands 
- the model is general. Associated with each RXi is a utility 
function representing some value or quality of service obtained 
by RXi. We assume that the utility function has the form 

where a{i) E {1, . . . ,n} is the index of one link other than i, 
representing the index of a dominant interferer to RXi. Thus, 
the assumption is that utility on each link i is a function of the 
scheduling decisions of the serving transmitter, TXi, and one 
dominant interfering transmitter, TXj for j = a{i). Generally, 
the dominant interferer with be the transmitter, other than the 
serving transmitter, with the lowest path loss to the receiver. 
The problem is to find the optimal solution 



X argmax_F(x), 

X 

where the objective function is the sum utility, 

n 



(1) 



(2) 



The utility function /(xi, ^^.(i)) accounts for the link-layer 
conditions that determine the rates and along with quality- 
of-service (QoS) requirements for valuation of the traffic. A 
general treatment of utility functions for scheduling problems 
can be found in p8[ , p9) . The optimization problem ([T]i can 
be applied to both static and dynamic problems. 

For static optimization, the scheduling vectors Xi are se- 
lected once for a long time period and the utility function is 
typically of the form 

= Ui{Ri{xi,x„(^i))), (3) 

where Ri{xi,Xcr(i)) is the long-term rate as a function of the 
TX scheduling decision Xi and decision x^i^ij on the dominant 
interferer, while Ui{R) is the utility as a function of the rate. 
The problem formulation above can incorporate any of the 
common utility functions including: Ui{R) = R which results 
in a sum rate optimization; Ui{R) — log(i?) which is the 
proportional fair metric and Ui{R) = —j3R~^ for some /? > 
called an /3-fair utility. Penalties can also be added if there is 
a cost associated with the selection of the TX vector Xi such 
as power. 

To accommodate time-varying channels and traffic loads, 
many cellular systems enable fast dynamic scheduling in time 
slots in the order of 1 to 2 ms. For these systems, the utility 
maximization can be re-run in each time slot. One common 
approach is that in each time slot i = 0, 1, 2 . . ., the scheduler 
uses a utility of the form 



fi{t,Xi{t),x„i^,){t)) = Wi{t)R^{t, {xi{t),x„(^i){t)), 



(4) 
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where Wi{t) is a time-varying weight given by the marginal 
utility 



dR 



(5) 



and Ri{t) is exponentially weighted average rate updated as 

R^{t + l) = {I - a)R,{t) + aR,{t,x^{t),x„^^^{t)), (6) 

where Xi{t) and Scr(i) are the TX scheduling decisions on 
the serving and interfering links at time t. Any maxima of 
the optimization ([TJ with the weighted utility Q is called a 
maximal weight matching. A well-known result of stochastic 
approximation |T7l is that if a — > 0, and the scheduler 
performs the maximum weight matching with the marginal 
utilities (|5]l, then for a large class of processes, the resulting 
average rates will maximize the total utility Ui{Ri{t)). 

The above utilities are designed for infinite backlog queues. 
For delay sensitive traffic, one can take the weights Wi{t) to 
be the queue length or head-of-line delay. Maximal weight 
matching performed with these weights generally results in so- 
called throughput optimal performance p6) . These results also 
apply to multihop networks with the so-called backpressure 
weights. 



A. Systems with IC 

As mentioned in the Introduction, a particularly valuable 
application of the methodology in this paper is for rate selec- 
tion in systems with interference cancelation (IC). To simplify 
the exposition, we consider the following simple model of a 
system with IC: Assume the power of each transmitter, TXj, 
is fixed to some level Pj, and let Gij denote the gain from 
TXj to RXi as let Ni denote the thermal noise at RXi. Without 
IC, the receiver RXi would experience a signal-to-interference- 
and-noise ratio (SINR) given by 



Pi 



Ek^.G.kPk+N, 



(7) 



If the interference is treated as Gaussian noise, and the system 
were to operate at the Shannon capacity, the set of rates Ri 
attainable at RXi would be limited to 



R,<\0g2{l+P^). 



(8) 



We call this set of achievable rates the reuse 1 region, since 
this set is precisely the rates achievable in a cellular system 
operating with frequency reuse 1 (i.e. all transmitters trans- 
mitting across the entire bandwidth) and treating interference 
as noise. We denote the reuse 1 region by Crousoi(i)- 

The addition of IC can be seen as a method to expand 
this rate region. Specifically, suppose each RXi can poten- 
tially jointly detect and cancel the signals from at most one 
interfering transmitter TXj for some index j = o'(i) ^ i. 
The limitation to a single interferer will enable the system 
to fit within the dominant interferer model described above. 
However, this limitation has minimal practical impact since 
jointly detecting more than one interferer is both computa- 
tionally intensive and seldom of much value anyway since the 
presence of strong interference from more than one source is 
rare. 



Now, to compute the region achievable with joint detection 
and IC consider Fig. [T] which shows the receiver RXi being 
served by the transmitter TXi while receiving interference 
from TXj for j — o'(i). In this model, assuming again that 
RXi can operate at the Shannon capacity, it can jointly detect 
the signals from both the serving transmitter TXi and interferer 
TXj, if and only if the rates Ri and Rj satisfy the multiple 
access channel (MAC) conditions pO): 



Ri + R, 



R^ 



< Iog2(l + Pi,0 

< l0g2(l 

< 



l0g2(l +ft,<T(»)), 



where, for £ — i or £ — cr(i), pi^g is the SINR, 

GiePe 



Pi,e 
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We denote this region by Cic(i) which is plotted in the bottom 
panel of Fig. [T| along with reuse 1 region Cicusoi(0- We can 
see that the addition of IC expands the set of achievable in the 
region where the interfering rate is low. In particular, when the 
interfering rate i?cr(i) is sufficiently low (to the left of point 
A in the figure), the achievable rate Ri on the serving link is 
identical to the case without any interference. In this sense, 
reducing the rate makes it more "decodable" or "cancelable". 
We let C(i) denote the total set of feasible rates, which is the 
union of the reuse 1 and IC regions: 

C(i) =Clc(i) UCrousel(«)- 

Note that this region is not convex. 

With these observations, we can now pose the IC problem 
in the dominant interferer model as follows: The decision 
variable at each transmitter TXi is an attempted rate, denoted 
Xi. The achieved rate at the receiver RXi is then simply 



Ri — Ri{x i , X fj {^i'^^ 



Xt, if {xi,x„(^,)) eC{i) 
0, if (xi,x<^(.,)) ^ C(i) 

That is, the rate is achieved rate is equal to the attempted rate 
on the serving link if and only if the serving and interfering 
rates are feasible; otherwise, the achieved rate is zero. Then, 
the optimization can be formulated as ([3]) for any utility 
function Ui{Ri). 

More sophisticated methods, such as power control and 
rate splitting as used in the well-known Han-Kobayashi (HK) 
method |21|, can also be incorporated into this methodology. 
For example, to incorporate the HK technique, the transmitter 
TXi would splits its transmissions into "private" and "public" 
parts and the decision variable Xi for that link would be a 
vector including the private and public rates and their power 
allocations. However, for simplicity, we do not simulate this 
scenario. 

III. Message Passing Algorithms 

A. Graphical Model Formulation 

As mentioned in the Introduction, graphical models provide 
a general and systematic approach for distributed optimization 
problems where the objective function admits a factorization 
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Fig. 1: Rate region with IC: When a receiver RXi is capable of 
IC, its achievable rate depends on both the serving rate rate Ri 
as well as the rate Rcr(i) on the interfering link. The total rate 
region, which includes the options of both reuse 1 and treating 
interference as noise as well as joint detection, is non-convex. 



into terms each with small numbers of variables ||6), ||7|. The 
optimization ([TJ with objective function ^ is ideally suited 
for this methodology. 

To place the optimization problem into the graphical model 
formalism, we define a factor graph G = {V, E), which is an 
undirected bipartite graph whose vertices consists of n variable 
nodes associated with the decision variables Xi, i — 1, . . . ,ti, 
and n factor nodes associated with factors fi, i = 1, . . . ,n. 
There is an edge between xg and fi if and only xg appears as 
an argument in fi - namely if £ — i or £ = a{i). 

As an example of the factor graph consider the network 
in the top panel of Fig. [2] with n = 6 links. Each link has 
a transmitter and receiver and the diagram indicates both the 
serving and interfering links. The factor graph representation 
is shown in the bottom panel where there is one variable, Xi, 
and one factor, fi, for each link i ^ 1, . . . ,n. 

B. Max-Sum Loopy BP 

Once the optimization problem has been formulated as a 
graphical model, we can apply the standard max-sum loopy BP 
algorithm. A general description of the max-sum algorithm can 
be found in any standard graphical models text such as |j6), |[7). 
For the dominant interferer optimization problem, the max- 
sum algorithm can be implemented as shown in Algorithm 

m 

Loopy BP is based on iteratively passing "messages" be- 
tween the variables and factor nodes. For the interferer prob- 
lem, the messages are passed between the receivers RXi and 
transmitters TXj where TXj is either the serving transmitter 
or dominant interferer of RXi. 

Each message is a function of the scheduUng decision Xj. 
The message function from TXj to RXi is denoted pj^i{xj) 
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□ ri ) 



Tx nodes Rx nodes 
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Fig. 2: Example factor grapli representation of a network: 

The top panel shows an example network with rt = 6 links, each 
with a transmitter and receiver. Solid lines indicate serving links 
and dotted lines indicate interfering links. The bottom panel 
shows its factor graph representation. 



and represents an estimate of the value of TXj selecting the 
scheduling decision Xj. This message can thus be interpreted 
as a sort of "soft" request-to-send. We use the term "soft" 
since the message ascribes a value to each possible scheduling 
decision. Similarly, the messages from RXi to TXj is denoted 
fij^i{xj) and has a role has both a "soft" clear-to-send and 
channel quality indicator (CQI). 

As shown in Algorithm [T] the messages at the receiver 
are initialized at zero and then iteratively updated in a set of 
rounds. Each round consists of two halves: In the first half, the 
receivers send messages to the transmitters and, in the second 
half, the transmitters send messages back to the receivers. The 
process is a repeated for a fixed number of iterations, and the 
final messages are used to compute the scheduling decision at 
the transmitters. 

C. Optimality under the Dominant Interferer Assumption 

We will discuss more detailed issues in implementing the 
algorithm momentarily. But, we first establish an important 
optimality, which is the main justification for the algorithm. 

Theorem 1: Consider the message passing max-sum algo- 
rithm. Algorithm [T] applied to the optimization ([TJ with the 
objective function (|2]l for any utility functions fi{xi,Xcr(i)) 
and dominant interferer selection function 17(1). Then, if all 
messages fj,i^j{xi) and yUi<_j(a;j) are fixed-points of the 
algorithm, the resulting scheduling decisions Xj are globally 
optimal solutions to ([T]) in that 

F(x) > F(x), 

for all other scheduling decision vectors x. 

Proof: See Appendix [A| ■ 

The theorem shows that if the max-sum algorithm converges 
to a fixed point, the solution will be optimal. Remarkably, 
this result applies to arbitrary utility functions as long as the 
dominant interferer assumption is valid. In particular, the result 
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Algorithm 1 Max-sum loopy BP for scheduling 

{Initialization at the RX } 
for all RXi do 

f^i^iixi) ^ 

^j^i{xj) ^ for j = cr(z) 
end for 
repeat 

{ Receiver half round } 

for all RXi do 
j ^ a{i) 

vaaxx- fi{xi,Xj) + ^j^i[xj) 
fJ-j^i{xj) maxj;. fi{xi,Xj) + ^i^i{xi) 
end for 

{ Transmitter half round } 
for all TXj do 

for all i s.t. j = a{i) do 

fJ-j^i{xj) ^ Hjixj) - fij^i{xj) 
end for 
end for 

until max number of iterations 

{ Final scheduling decision } 
for all TXj do 

Xj -It- argmaXj.^ Hj{xj) 
end for 



applies to even non-convex utilities such as the ones arising 
in the IC problem. The result is unexpected since loopy BP 
is generally only guaranteed to be optimal in graphs with no 
cycles, and the factor graph we are considering may not be 
acyclic. 

Of course, the theorem does not imply the convergence 
of the algorithm. Indeed, since the graph has cycles, the 
algorithm may not converge. However, the iterations of loopy 
can be "slowed down" using fractional updates with the 
same fixed points to improve convergence at the expense of 
increased number of iterations. However, as we will see in the 
simulations, convergence does not appear to be an issue in the 
cases we examine. 

D. Implementation Considerations 

We conclude with a brief discussion of some of the practical 
considerations in implementing the message-passing algorithm 
in commercial cellular systems. 

• Communication overhead: Most important, it is neces- 
sary to recognize that the message-passing needed for 
the proposed algorithm would require additional control 
channels not present in current cellular standards such 
as LTE or UMTS. New control channels would be 
needed to communicate the rounds of messaging prior 
to each scheduling decision. However, similar messages 
have been proposed in Qualcomm's peer-to-peer system. 



FlashLinQ | |30| , and also appear in some optimized 
CSMA algorithms such as fJTl. Under the dominant 
interferer assumption, the communication overhead may 
particularly small since each receiver must send and 
receive messages to and from at most two transmitters 
in each round (the serving transmitter and the dominant 
interferer). However, each message is an entire function, 
so the message must in principle contain a value for each 
possible Xi. However, if the number of values Xi is small, 
or the function can be well-approximated in some simple 
parametric form, the overall communication per round 
overhead will be low. Moreover, as we will see in the 
simulations, very small numbers of rounds are typically 
needed; sometimes as small as one or two. 

• Computational requirements: The computational require- 
ments are low. The transmitter must simply sum func- 
tions, and the receivers must perform one-dimensional 
optimizations. Thus, as long as the sets of possibilities 
for each Xi is small, the computation will be minimal. 

• Channel and queue state information: The utility 
fi{xi,Xj) is generally based on the rate achievable at 
RXz as a function of the scheduling decisions at both the 
serving transmitter TXz and the dominant interferer TXj. 
Thus, for the receiver to perform the optimization in the 
message passing algorithm, the receiver must generally 
know the channel state from the both the serving and 
interfering channels along with noise from all other 
sources. This channel state can be estimated from pilot or 
reference signals which are already transmitted in most 
cellular systems. However, the utility function will also 
generally depend on the queue state at the transmitter, 
particularly, the priorities and size of packets in the 
transmit buffer. This information must thus be passed, 
somehow, to the receiver Cellular systems such as LTE 
already pass such information in the uplink through buffer 
status reports 1 15 1, however a similar channel would need 
to be placed in the downlink for this message passing 
algorithm to work. 

• Synchronization: Since the message passing loopy BP 
algorithm is based on coordinated scheduling, there is 
an implicit assumption of synchronization. In a system 
such as LTE, this would require synchronization at the 
subframe level, where the subframes are 1ms. Since 
the intended use case for such coordinated scheduling 
applications is for small cell systems (e.g. 100 to 200m 
cell radii), such synchronization accuracy is reasonable. 

IV. Simulation Results 

We validate the max-sum loopy BP algorithm for interfer- 
ence cancellation in two simulations. In both cases, we con- 
sider static optimizations of log utility, U{Ri) = log Ri, which 
is the proportion fair metric |28|. For the link-layer model, 
we assume that the rate region is described by the Shannon 
capacity with a loss of 3dB and there is a maximum spectral 
efficiency of 5 bps/Hz. The maximum spectral efficiency in 
practice is determined by the maximum constellation, which 
itself is usually limited by the receiver dynamic range. In the 
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Parameter 


Value 


Network topology 


4x4 apartment model, with active links in 
10 of the 16 apartments. 


Bandwidth 


5 MHz 


Wall loss 


1 n ATI 


Lognormal shadowing 


10 dB std. dev. 


Path loss 


38.46 + 20 logiQ(iJ) + 0.7i? dB, R distance 
in meters. 


Femto BS TX power 


dBm 


Femto UE noise figure 


4 dB 



TABLE I: Simulation parameters for apartment model. 



implementation of loopy BP, we assume a discretization of the 
rates using logarithmic spacing of the 25 rate points from zero 
to the maximum spectral efficiency. In practice, one should use 
the discrete modulation and coding scheme (MCS) values that 
are available at the link layer In LTE, there are approximately 
30 such MCS values |32|. 

Note that the final rates Xj from Algorithm [T| may not be 
feasible, unless the algorithm has been run to convergence. 
To evaluate the algorithm prior to convergence, we obtain a 
feasible point from the values Xj at the end of the algorithm 
as follows: At each receiver RXi, if {xi,x„(^i-j) G C{i) (that 
is, the rate pair is feasible), we take the rate Ri = Xi. 
Otherwise we take Ri to be the largest value with Ri < Xi and 
{Ri,Xcr{i)) G It is simple to show that the resulting set 
of rates, {Ri, Ra(i)), is feasible for all i. This final projection 
step can be performed with local information only. 

A. 3GPP Femtocellular Apartment Model 

In our first simulation, we use a simplified version of an 
industry standard model |5l for validating ICIC algorithms of 
femtocells in densely packed apartments. The apartments are 
assumed to be laid out in a 4x4 grid, each apartment being 
1 Ox 10m. In 10 of the 16 apartments, we assume there is an 
active link with a femto base station and mobile (user equip- 
ment or UE in 3GPP terminology). The femtos are assumed 
to operate in closed access mode so that a femto UE can 
only connect to the femto base station in its apartment. This 
restricted association is a major source of strong interference 
in femtocellular networks and thus a good test case for IC. 
The complete list of simulation parameters are given on the 
Table M 

Fig. [5] shows the cumulative probability distribution of re- 
sulting spectral efficiencies (bits/sec/Hz) based on 100 random 
drops of the network. In this figure, IC is compared against 
reuse 1 where no IC is used and all links transmit on the same 
bandwidth, treating interference as noise. We see that for cell 
edge users (defined as users in the lowest 10% of rates), there 
is a gain of almost 4 times in the rate by employing IC. Hence, 
IC has the potential for significant value for improving rates in 
such femtocellular scenarios, particularly for mobiles in strong 
interference. 

Also shown in Fig. [5] is a comparison of the optimal rate 
selection against the selection from max-sum loopy BP run for 
4 iterations. We see there is an exact match. In fact, even after 
only 1 iteration there is a minimal difference. The optimal 
rate selection is found as follows: For each RXi, the rate pair 




10 

Rate (bps/Hz) 

Fig. 3: Simulation with 3GPP apartment model: Plotted 
are the distribution of rates (bps/Hz) over 100 drops. We see 
that IC provides significant gains particularly at the cell edge. 
Moreover, even one iteration of max-sum loopy BP finds near- 
optimal rates for the non-convex optimization problem. 
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Fig. 4: Road network model: Using open-access femto- 
cells within apartments may enable low-cost wide-area cellular 
coverage. However, mobility within small cells can result in 
significant interference as handover delays cause mobile users 
to drag into neighboring cells. 



must be either in the reuse 1 region Ciousci(i) or 
the joint detection region Cic(«). Since there are two choices 
for each link, there are a total of 2" choices in the network. 
For each of the 2" choices, the total rate region is convex and 
we can maximize the utility over that region. For the optimal 
rate selection, we repeat the convex optimization for every 
one of 2" choices and select the one with the maximum total 
utility. Obviously, the max-sum loopy BP algorithm is much 
simpler. 

B. Road Network with Mobility 

A second case of strong interference in femto- or pico- 
cellular networks is handover delays. When cells are small, 
handovers are frequent. Also, since femtocells are not be 
directly into the operator's core network |33|, the handovers 
between femtocells or from the femtocells to the macrocell 
may be significantly delayed |3 |. Due to the delays, the mobile 
may drag significantly into cells other than the serving cell, 
exposing the mobile to strong interference. 

To evaluate the ability of IC to mitigate this strong in- 
terference, we considered a road network model shown in 
Fig.|4] A similar model was considered in |34|. In this model. 
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Parameter 


Value 


Network topology 


10 Ix s in/on apartments by the road, 10 
mobile Rx's on the road in periodic 


Road width 


10 m 


Road length 


50 m periodicity 


Apartment dim. 


10 m width and 20 m length 


Bandwidth 


5 MHz 


Wall loss 


dB 


Lognormal shadowing 


10 dB std. dev. 


Path loss 


10 + 371og]^Q(i2) dB, R distance in meters. 


Rx velocity 


15 - 25 m/s 


Femto BS TX power 


dBm 


Femto UE noise figure 


4 dB 



TABLE II: Simulation parameters for road model. 



0. 



P 0. 



0.4 



O 0.2 




10" 

Rate (bps/Hz) 

Fig. 5: Simulation for the road network model: Plotted are 
the distribution of rates (bps/Hz) over 100 drops. Again, we see 
that IC provides significant gains in rates throughout the rates, 
but particularly at the cell edge. Moreover, max-sum loopy BP 
finds near-optimal selections after only 1 iteration. 



the transmitters are femtocells placed in apartments at two 
sides of the road. The receivers are mobile on the road with 
a random velocity between 15 and 25 m/s. Each mobile is 
initially connected to the strongest serving cell. After the 
connection, we let mobiles to move for a time uniformly 
distributed between and 1 second to model the effect of 
delayed handover We then perform a static simulation taken 
at this snapshot in time. The complete list of simulation 
parameters are given on the Table |l] which are loosely based 
on the path loss models in |5|. 

Fig. |5] shows the distribution of rates based on 100 random 
drops. Again, we see that IC provides significant improvement 
in cell edge rate: the 10% rate is increased by more than a 
factor of 3. In fact, even the median rate is increased almost 
twofold. In addition, max-sum loopy BP finds near optimal 
rates. 

V. Conclusions 

A general methodology based on max-sum loopy BP is 
presented for wireless scheduling and resource allocation 
problems in networks where each link has at most one 
dominant interferer The algorithm is extremely general and 
can incorporate advanced link-layer mechanisms available in 
modern cellular systems. Moreover, the method is completely 
distributed, requires low communication overhead and mini- 
mal computation. In addition, although we did not establish 



the convergence of the algorithm, we have shown that if it 
converges, the result is guaranteed to be globally optimal. 
Remarkably, this result holds for arbitrary problems, even 
when the problems are non-convex, the only requirement being 
the dominant interferer assumption. 

The method was applied to systems with IC. We propose 
that when IC is available at the link-layer, rate control as 
opposed to power control can provide an effective mech- 
anism for interference coordination. Moreover, the optimal 
rate selection is performed extremely well by max-sum loopy 
BP. Simulations of the algorithm in femtocellular networks 
demonstrate considerable improvement in capacity from the 
addition of IC for handling both restricted association and 
mobility. In addition, loopy BP appears to provide near optimal 
performance with only one or two rounds of messaging, thus 
making the methodology extremely attractive for practical 
networks. 

Appendix A 
Proof of Theorem[T] 

We begin by reviewing the following well-known result 
from ijTsJ, p5] : Let Gu = {V,Eij) be an undirected graph 
with vertices V = {!,..., n}, and suppose that a function 
F(x) admits a pairwise factorization of the form 

F(x)- J2 ^^o{x^.Xj), (11) 

for some set of functions fij{xi,Xj). Note that the graph Gjj 
is not the factor graph. One can then apply max-sum loopy 
BP to attempt to perform the optimization 



X = argmax_F(x). 



(12) 



It is well-known that max-sum loopy BP will provide the 
optimal solution when Gjj has no cycles. However, our 
analysis requires the following. 



Theorem 2 ( 475yj.- Consider max-sum loopy BP applied to 
the optimization ( [T2] l to an arbitrary pairwise objective func- 
tion of the form ( [TT| ). Suppose that each connected component 
of the undirected graph Gjj has at most one cycle. Then, if 
the belief messages are fixed points of max-sum loopy BP, the 
corresponding solution x is optimal in that F(x) > F(x) for 
all vectors x. 

Now, the function (|2| we are interested in is a special case 
of the form ( [TT] i, so we need to show that the corresponding 
undirected graph, Gjj, has the same single cycle property. The 
undirected graph for the objective function (|2]) is simple to 
describe: First let G^i = {V,Eo) be the directed graph with 
vertices V = {\, . . . ,n} and directed edges 

Ed = i = 1, . . .,n}. 

Then, if Gu = {V,Eu) is the corresponding undirected graph 
to Gd, then it is also the undirected graph for the objective 
function (|2]l. For illustration. Fig. |6] shows the directed graph 
Gb for the example network mentioned earlier in Fig. [2] Note 
that there is a directed edge from j to i if and only if TXj 
is the dominant interferer of RXi. Since each receiver has at 
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Fig. 6: Directed graph for the example network in Fig. l2| 
There is a directed edge from j to i if and only if j — o-{t), 
namely TXj is the dominant interferer of RXi. 

most one dominant interferer, each node in G^i has at most 
one incoming edge. That is, the in-degree is at most one for 
all nodes. This property will be key. 

To apply Theorem |2j we need to show that each connected 
component of Gu has at most one cycle. We begin with the 
following lemma. 

Lemma 1: Consider any path = (ii, 22, ^m), in 

the undirected graph Gu- Then either («2,ii) G Ejy or 
(im-Ii^m) G Ed. That is, either the first or last segments 
of the path are pointing "outwards". 

Proof: The proof follows by induction. For M = 2, either 
(12, ii) or (11,12) must be a directed edge. Now suppose the 
the lemma is true for all paths of length M — 1, and consider 
a path of length M. If (12, ii) € Ejj we are done, so suppose 
(*2,ii) ^ Ed. Then, applying the induction hypothesis to 
Pm-1 shows that (u/-2, Hf-i) must be a directed edge. But, 
since the node iM-i can have at most one incoming edge, 
(*M-i,*A/) must be outgoing and hence in E^. So, we have 
shown that either (12, «i) G Ed or G Ed. ■ 

We wow return to the graph Gjj and show that each 
connected component can have at most one cycle. We prove 
the property by contradiction. Assume there are two loops in 
the same connected component of Gjj' 

L = {ii,i2, ...,iK,H) 
J = O'l, ^2, ji), 

with no repetitions of points within each loop, aside from the 
first and last points. Now there are three possibilities as shown 
in Fig. |7] (a) The two loops have no common point, (b) one 
common point and (c) two or more common points. We show 
each of these possibilities leads to a contradiction. 

Case (a) No common points: By assumption, L and J 
are in the same connected component, so there must be a path 
in the undirected graph between between one point in L and 
a second point in J. Without loss of generality (WLOG), we 
can assume the path goes from ii to ji and we will write the 
path as 

F = (il = fci, fc2, . . . , fcAT = ji). 

We can construct this path such that, aside from ii and ji, 
this path has no common points with the loops L or J. Now, 
by Lemma[T] either (fc2, fci) or {kN-i, k^) is a directed edge 
(i.e. in Ed). WLOG suppose (^2, fci) € Ed', the other case is 
similar. The point fci would correspond to point B in Fig. |7] 
with the loop on the right being loop L. Thus fci = ii has an 




(c) 

Fig. 7: Possible positions of two loops in the same connected 
component. 

incoming edge from fc2 . But Lemma [T] applied to the path L 
implies that there must be an incoming edge from either i2 or 
lA'. When combined with the incoming edge from fc2, ii has 
more than one incoming edge, which is impossible. 

Case (b) One common point: WLOG we can assume that 
the point ii — ji is the common point in loops L and J. The 
point is shown as A in Fig. [7] Applying Lemma [T| to both 
loops L and J show that there must be at least one incoming 
edge into point A from each loop. Thus, A has more than one 
incoming edge, which again is a contradiction. 

Case (c) Two or more common points: Taking a minimal 
common path, and possibly reaiTanging the indexing, we can 
assume that the first N points of the loops L and J are 
common so that 

h ^ ji, . . . ,iN ^ JN, 

but iN+i 7^ Jn+i, and ^ Jm- The situation is depicted 
in Fig. |7j where point A corresponds to ii (= ji) and B 
corresponds to z^v (= Jn)- Applying Lemma [T] to the path 
along the common segment from A to B, 

P= (ii,...,ZAr) = {jl,...jN), 

implies that either {i2,ii) or {iN-i,iN) is a directed edge. 
WLOG assume {iN-i,iN) is a directed edge, creating an 
incoming edge into point B (the point with index ijv = Jn)- 
Since any node can have at most one incoming edge, both the 
edges {iNj'iN+i) in loop L and {jN,jN+i) in loop J must 
be directed edges outgoing from ijy — jiy. Now consider the 
two paths from — back to ii = ji along loops L and 
J. From Lemma [1] since the initial segments, (iAr,ijv+i) in 
loop L and (jjv,ijv+i) in loop J, are directed edges, the final 
segments (j_fs-,zi) and {jM,ji) must also be directed edges. 
Thus, there are two incoming edges into point A (the point 
with index ii = ji). This is a contradiction. 

Thus, all three cases are impossible, and we have proven by 
contradiction that any connected component of Gjj can have 
at most one cycle. We can therefore apply Theorem |2] and the 
proof of Theorem [T] is complete. 
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